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Abstract 

An axis-parallel /c-dimensional box is a Cartesian product R\ X R% X ■ ■ ■ X Rk 
where Ri (for 1 < i < k) is a closed interval of the form [aj,6j] on the real line. 
For a graph G, its boxicity box(G) is the minimum dimension k, such that G is 
representable as the intersection graph of (axis-parallel) boxes in fc-dimensional 
space. The concept of boxicity finds applications in various areas such as ecology, 
operations research etc. 

A number of NP-hard problems are either polynomial time solvable or have much 
better approximation ratio on low boxicity graphs. For example, the max-clique 
problem is polynomial time solvable on bounded boxicity graphs and the maximum 
independent set problem for boxicity 2 graphs has a 1 + - log n approximation 
ratio for any constant c. In most cases, the first step usually is computing a low 
dimensional box representation of the given graph. Deciding whether the boxicity 
of a graph is at most 2 itself is NP-hard. 

We give an efficient randomized algorithm to construct a box representation of 
any graph G on n vertices in [(A + 2) Inn] dimension, where A is the maximum 
degree of G. This algorithm implies that box(G) < [(A + 2) Inn] for any graph G. 
Our bound is tight up to a factor of Inn. 

We also show that our randomized algorithm can be derandomized to get a poly- 
nomial time deterministic algorithm. 

Though our general upper bound is in terms of maximum degree A, we show 
that for almost all graphs on n vertices, their boxicity is 0(d av lnn) where d av is 
the average degree. 

Key words: Boxicity, randomized algorithm, derandomization, random graph, 
intersection graphs 
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1 Introduction 



Let T = {S x C U : x G V} be a family of subsets of a universe U, where 
V is an index set. The intersection graph A(jF) of T has V as vertex set, 
and two distinct vertices x and y are adjacent if and only if S x fl S y 7^ 0. 
Representations of graphs as the intersection graphs of various geometrical 
objects is a well studied topic in graph theory. Probably the most well studied 
class of intersection graphs are the interval graphs, where each S x is a closed 
interval on the real line. 

A well known concept in this area of graph theory is the boxicity, which was 
introduced by F. S. Roberts in 1969 |16j . This concept generalizes the concept 
of interval graphs. A /c-dimensional box is a Cartesian product R\ x R 2 x 
■ ■ ■ x Rk where Ri (for 1 < i < k) is a closed interval of the form [a,, bj\ on 
the real line. For a graph G, its boxicity is the minimum dimension k, such 
that G is representable as the intersection graph of (axis-parallel) boxes in 
/c-dimensional space. We denote the boxicity of a graph G by box(G). The 
graphs of boxicity 1 are exactly the class of interval graphs. The boxicity of a 
complete graph is by definition. 

It was shown by Cozzens [10] that computing the boxicity of a graph is NP- 
hard. This was later improved by Yannakakis [21], and finally by Kratochvil 
[T5] who showed that deciding whether the boxicity of a graph is at most 2 
itself is NP-complete. 

In many algorithmic problems related to graphs, the availability of certain 
convenient representations turns out to be extremely useful. Probably, the 
most well-known and important examples are the tree decompositions and 
path decompositions [5] . Many NP-hard problems are known to be polynomial 
time solvable given a tree(path) decomposition of bounded width for the input 
graph. Similarly, the representation of graphs as intersections of "disks" or 
"spheres" lies at the core of solving problems related to frequency assignments 
in radio networks, computing molecular conformations etc. For the maximum 
independent set problem which is hard to approximate within a factor of 
n ( l / 2 )-£ f or general graphs [13], a PTAS is known for disk graphs given the disk 
representation [TlfT] . In a similar way, the availability of box representation 
in low dimension make some well known NP hard problems polynomial time 
solvable. For example, the max-clique problem is polynomial time solvable in 
boxicity k graphs since there are only 0((2n) h ) maximal cliques in such graphs. 
It was shown in [T3] that the complexity of finding the maximum independent 
set is hard to approximate within a factor n <yl / 2 ' > ~ t for general graphs. In fact, 
[T3] gives the stronger inapproximability result of n 1_<E , for any e > 0, under 
the assumption that NP ^ ZPP. Though this problem is NP-hard even for 
boxicity 2 graphs, it is approximable to a logn factor for boxicity 2 graphs 
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given a box representation [2lH] . 

Thus, it is interesting to design efficient algorithms to represent small boxicity 
graphs in low dimensions. Roberts [TB] had given a general upper bound of 
n/2 for the boxicity of any graph on n vertices. In this paper, we show an 
upper bound of [(A + 2) Inn] for the boxicity for any graph G on n vertices 
and having maximum degree A by giving a randomized algorithm that yields a 
box representation for G in [(A + 2) Inn] dimension in 0(An 2 In 2 n) time with 
high probability. We also derandomize our randomized algorithm and obtain 
a deterministic polynomial time algorithm to do the same. Very recently, the 
authors had shown that for any graph G with maximum degree A, box(G) < 
2 A 2 [23J. However, it may be noted that the result in this paper yields better 
bounds on boxicity for graphs where A > Inn. 

In a recent manuscript [7J the authors showed that for any graph G, box(G) < 
tw(G) + 2, where tw(G) is the treewidth of G. This result implies that the class 
of 'low boxicity' graphs properly contains the class of 'low treewidth graphs'. 
It is well known that almost all graphs on n vertices and m = cn edges (for a 
sufficiently large constant c) have Q(n) treewidth [14J. In this paper we show 
that almost all graphs on n vertices and m edges have boxicity 0(d av Inn) 
where d av = 2m/ n. An implication of this result is that for almost all graphs 
on m = cn edges, there is an exponential gap between their boxicity and 
treewidth. Hence it is interesting to reconsider those NP-hard problems that 
are polynomial time solvable in bounded treewidth graphs and see whether 
they are also polynomial time solvable for bounded boxicity graphs. 

Researchers have also tried to bound the boxicity of graph classes with special 
structure. Scheinerman [I7J showed that the boxicity of outer planar graphs 
is at most 2. Thomassen [19] proved that the boxicity of planar graphs is 
bounded above by 3. Upper bounds for the boxicity of many other graph 
classes such as chordal graphs, AT-free graphs, permutation graphs etc. were 
shown in [7J by relating the boxicity of a graph with its treewidth. Researchers 
have also tried to generalize or extend the concept of boxicity in various ways. 
The poset boxicity [20J, the rectangle number [8], grid dimension [3], circular 
dimension [T21T8] and the boxicity of digraphs [9] are some examples. 

1.1 Our Results 

We summarize below the results of this paper. 

(1) We show that for any graph G on n vertices with maximum degree A, 
box(G) < [(A + 2) Inn] . This bound is tight up to a factor of 2 Inn. 

(2) In fact, we show a randomized algorithm to construct a box representation 
of G in [(A + 2) Inn] dimension, that runs in 0(An 2 In 2 n) time with high 
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probability. 

(3) Next we show a polynomial time deterministic algorithm to construct 
a box representation in [(A + 2) Inn] dimension by derandomizing the 
above randomized algorithm. 

(4) Though the general upper bound that we show is in terms of the max- 
imum degree A, we also investigate the relation between boxicity and 
average degree. We show that for almost all graphs on n vertices and 
m edges, the boxicity is 0(d av \nn), where d av = 2m/n is the average 
degree. 

(5) We also derive an upper bound for boxicity in terms of m and n. We 
show that for any connected graph G, box(G) < h\/m\nn, which is tight 
up to a factor of byhin for a constant b. 



1.2 Definitions and Notations 

Let G be an undirected simple graph on n vertices. The vertex set of G is 
denoted as V(G) = {1, • • • ,n} (or V in short). Let E{G) denote the edge 
set of G. We denote by G, the complement of G. We say that the edge e is 
missing in G, if e G E{G). A graph G' is said to be a super graph of G where 
V(G) = V(G'), if E(G) C E(G'). For a vertex u G V, let N(u) denote the 
set of neighbors of u in G i.e., N(u) = {v G V \ u ^ v and (u,v) G E(G)}. 
Let d{u) denote the degree of u in G, i.e. d{u) = \N(u)\ . Let A denote the 
maximum degree of G. 

Definition 1 (Projection) Let n be a permutation of the set {1, . . . , n}. Let 

X C The projection of n onto X denoted as ttx is defined as 

follows. Let X = {ui, . . . ,u r } such that 7r(ui) < n(u 2 ) < ... < n(u r ). Then 
■k x {ui) = 1, 7T X (u 2 ) = 2,..., ir x (u r ) = r. 

Definition 2 (Interval Representation) An interval graph can be repre- 
sented as the intersection graph of closed intervals on the real line JR. Such 
a representation is called an interval representation. Equivalently and in fact 
more precisely, an interval representation of an interval graph G can be de- 
fined by two functions I : V — > 1R and r : V — > 1R. The interval corresponding 
to a vertex v denoted as I(v) is given by [l(v),r(v)\, where l{v) and r(v) are 
the left and right end points of the interval corresponding to v. 

Definition 3 Given a graph G and a permutation n of its vertices {1, . . . , n}, 
we define a map M.(G,ix), which associates to the pair (G,ir), an interval 
supergraph G' of G, as follows: consider any vertex u G V(G). Let n u G 
N(u)U{u} be the vertex such thatn{n u ) = mm weN ( u ) U { u y n(w). Then associate 
the interval [n(n u ), tt(u)] to the vertex u. Let G' be the resulting interval graph. 
It is easy to verify that G' is a super graph of G. We define M.{G, it) = G' . 
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1.3 Box Representation and Interval Graph Representation 

Let G be a graph and let Ix, . . . , i& be k interval graphs such that each Ij is 
defined on the same set of vertices V. That is, V(Ij) = V{G) for j — 1, . . . , k. 
If 

E(G) = E(ii) n ■ ■ • n E(i k ), 

then we say that 1%, . . . , If; is an interval graph representation of G. The fol- 
lowing equivalence is well-known. 

Theorem 4 (Roberts |16j ) Let G be a simple, undirected, non-complete 
graph. Then, the minimum k such that there exists an interval graph rep- 
resentation of G using k interval graphs I\,...,If. is the same as box(G). 

Recall that a fc-dimensional box representation of G is a mapping of each 
vertex u G V to R±(u) x ■ ■ • x R)~{u), where each Ri(u) is a closed interval 
of the form [£i(u), rj(-u)] on the real line. It is straightforward to see that 
an interval graph representation of G using k interval graphs is 
equivalent to a fc-dimensional box representation in the following sense. Let 
Riiu) = [£i(u), ri(u)} denote the closed interval corresponding to vertex u in 
an interval representation of Jj. Then the /c-dimensional box corresponding 
to u is simply R\{u) x • ■ ■ x Rk(u). Conversely, given a fc-dimensional box 
representation of G, the set of intervals {Ri(u) : u G V} forms the ith interval 
graph li in the corresponding interval graph representation. 

When we say that a box representation in t dimension is output by an algo- 
rithm, the algorithm actually outputs the interval graph representation: that 
is, the interval representation of the constituent interval graphs. 



2 The randomized construction 

Consider the following randomized procedure RAND which outputs an in- 
terval super graph of G. Let A be the maximum degree of G. 



RAND 

Input: G. 

Output: G' which is an interval super graph of G. 
begin 
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stepl. Generate a permutation ir of {1, . . . , n} uniformly at random. 

step2. Return G' = M(G, tt). 

end. 

Lemma 5 Let e = (u,v) <£ E(G). Let G' be the output of RAND (G). Then, 



PROOF. 

We have to estimate the probability that u and v are adjacent in G' . That is, 



Let n u G N{u) U {u} be a vertex such that Tr(n u ) = min weA r(V)u{«} tt(w). Sim- 
ilarly let n v G N(v) U {v } be a vertex such that ir(n v ) = min^^^uM n(w). 

Clearly I{u) = [ir(n u ), tt(u)] and I{y) = [ir(n v ), n(v)]. Thus, it is easy to see 
that I{u) f]I(v) ^ if (a) 7r(n„) < n(v) < n{u) or (b) n(n v ) < n(u) < n(v). 
On the other hand, I(u) P\I(v) ^ only if either (a) or (b) hold. To see this, 
first observe that n(u) ^ 7r(u>) for any w G N(v) U {v } and ir(v) ^ n{w) for 
any w G iV(-u) U {u}. Without loss of generality, let n(u) < tt(v). Now, it is 
obvious that I{u) fl I(y) ^ 0, n(n v ) < n(u) < ir(v) since ir(n v ) ^ n(u). Since 
(a) and (b) are mutually exclusive, 

Pr[e G E(G')\ = Pr[7r(n u ) < ir(v) < ir(u)} + Pr[7r(n„) < ir(u) < n(v)} . 

We bound Pr[7r(n u ) < ir(v) < ir(u)] as follows. Let X = {u} U N(u) U {v}. 
Let Tlx be the projection of n onto X. Clearly, the event n(n u ) < ir(v) < n(u) 
translates to saying that 7Tx(v) < ttx(u) and irx(v) ^ 1. Note that i\x can be 
any permutation of \X\ elements with equal probability, which is urjx+zy ■ The 
number of permutations where ttx(v ) < ttx{ u ) equals (d(u) +2)!/2. Moreover, 
the number of permutations where ttx{v) = 1 equals (d(u) + 1)!. Note that 
the set of permutations with nx{v) = 1 is a subset of the set of permutations 
with 7Tx(v) < n x (u). It follows that 



which is 2<d(u)+2) • Using similar arguments, it follows that 




I(u) f]I(v) ^ 0. 



Pr[nx(v) < ttx(u) and nx(v) ^ 1] 



{d{u) + 2)\/2-{d{u) + l)\ 
(d(u) + 2)\ 




Pr[n(n v ) < ir(u) < tt(v)] 
lows. 



2(d(v)+2) 



. Summing the two bounds, the result fol- 
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Lemma 6 Let I±, I2, ■ • • ,It be the output generated byt invocations of RAND(G). 
Ift > (A + 2) Inn then Pt[E(G) = E{I X ) n E(I 2 ) fl • • • fl E{I t )\ > \. 



PROOF. For e = (u,v) <£ E(G), let Z e denote the event 

e G E{h) f\e G E{I 2 ) . . . f\e G E{I t ) 

That is, Z e denote the event that e G n E(I 2 ) • • • n £7(I t ). Note that 

for any 2,j G {l,...,t}, i 7^ j, the events e G and e G -E'(ij) are 

independent. It follows from Lemma [5] that 



Pr[Z e 



Hence by union bound, 



1] Pr[eG £(/,)] < 



je{i,...,t} 



A 



A + 2 



Pr 



V z e 

e£E(G) 



n 



e<£E(G) 



A 



2 VA + 2 



< (n 2 /2)e^+s 



If we choose t = ( A + 2) In n then the above probability is upper bounded by 
1/2. Recall that E{Ij) C E(G) for all j. It follows that Pr\E(G) = f]j E(Ij) 



Pr 



Ve^E(G) %e 



Thus the result follows. 



Since the set of interval graphs generated by [(A + 2) Inn] invocations of 
RAND(G) is a valid interval graph representation of G with non-zero prob- 
ability as shown above, we have the following corollary. 

Corollary 7 Let G be a graph on n vertices and with maximum degree A. 
Thenbox(G) < [(A + 2) Inn] . 

Lemma 8 The RAND procedure can be implemented in 0(m + n) time as- 
suming that a permutation of {1, ... ,n} can be generated uniformly at random 
in 0(n) time. 



PROOF. The vertex set of G is {1, . . . ,n}. We make the standard assump- 
tion that G is available as an adjacency list representation. First generate 7r 
and store the mapping 7r -1 () in an array indexed from 1 to n. Maintain an- 
other array D indexed from 1 to n such that D[i] = 1 if l(i) for vertex i is 
already defined and D[i] =0 otherwise. We now construct G' = M(G, 71) in n 
steps as follows. In ith step, consider the vertex u = 7r _1 (i). Define r(u) = i. 
If D[u] = then l(u) = i. For each element w G N(u), if D[w] = then define 
l(w) = i and assign D[w] = 1. Thus total time taken in step i is 0{\d{u) \ + 1) 
steps. It follows that the interval representation of G' can be constructed in 
0(m + n) steps. ■ 
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Theorem 9 Given a graph G onn vertices andm edges, with high probability, 
a box representation of G in |~(A + 2) Inn] dimension can be constructed in 
0(An 2 ln 2 n) time, where A is the maximum degree ofG. 

PROOF. We construct an algorithm that takes the graph G as input and 
tries to compute an interval graph representation for G. It repeatedly com- 
putes a set S of [(A + 2) Inn] interval supergraphs of G until it generates a 
set that is a valid interval graph representation of G. If the algorithm fails to 
find such a set after (log 2 e) In n tries, it reports a failure. Each S is computed 
through [(A + 2) Inn] invocations of RAND(G). From lemma [6j the prob- 
ability of failure of this algorithm is at most (l/2)( log2e ' )lnn = 1/n and hence 
it computes a valid interval graph representation of G with high probability. 
Using lemma [HI it is easily verified that to compute a set of (A + 2) Inn inter- 
val graphs, our algorithm takes 0(A(m + n) Inn) time. To verify whether a 
generated set of (A + 2) Inn interval graphs is a valid interval representation 
or not, it takes 0(n 2 A \nn) time. Thus the overall complexity is 0(An 2 In 2 n). 



2.1 Almost tight example 

We remark that for any given A and n > A + 1, we can construct a graph G 
on n vertices and with maximum degree A such that box(G) > [(A + 2)/2j. 
We assume that A is even for the ease of explanation. Roberts [IE] has shown 
that for any even number k, there exists a graph on k vertices with degree 
k — 2 and boxicity k/2. We call such graphs as Roberts graphs. The Roberts 
graph on n vertices is obtained by removing the edges of a perfect matching 
from a complete graph on n vertices. We take such a graph by fixing k = A + 2 
and we let the remaining n — (A + 2) vertices be isolated vertices. Clearly, 
the boxicity of such a graph is also k/2 = (A + 2)/2, whereas the maximum 
degree is A. Thus our upper bound is tight up to a factor of 2 Inn. 



3 Derandomization 

In this section we derandomize the above randomized algorithm to obtain a 
deterministic polynomial time algorithm to output the box representation in 
(A +2) Inn dimensional space for a given graph G on n vertices with maximum 
degree A. 

Lemma 10 Let G = (V, E) be the graph. Let E(G) be the edge set of the 
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complement of G. Let H C E(G). Then we can construct an interval super 
graph G' of G in polynomial time such that \E{G') Pi H\ > -^^\H\. 

PROOF. We derandomize the RAND algorithm to devise a deterministic 
algorithm to construct G' . 

Our deterministic strategy defines a permutation 7r on the vertices {1, . . . , n} 
of G. The desired G' is then obtained as M.{G, it). Let the ordered set V n =< 
Vi, . . . ,v n > denote the final permutation given by it. We construct V n in a 
step by step fashion. At the end of step i, we have already defined the first 
i elements of the permutation, namely the ordered set Vi =< vi,...,Vi >, 
where each Vj is distinct. Let Vo denote the empty set. Having obtained V, for 
i > 0, we compute Vi + i in the next step as follows. 

Given an ordered set Vi of i vertices < v i, t>2, . . . , v i >, let Vi o u denote the 
ordered set of the i + 1 vertices < v i, t>2, . . . , v «, u >. (We will abuse notation 
and use Vi to denote the underlying unordered set also, when there is no chance 
of confusion.) Let VqOu denote < u >. 

Consider an invocation of the randomized algorithm RAND whose output is 
denoted as G". For each e G H, let x e denote the indicator random variable 
which is 1 if e ^ E(G"), and otherwise. Let X H = J2eeH x e- 

Given an ordered set S =< ui, . . . ,u r >, let 2(S) denote the event that the 
first r elements of the random permutation generated by RAND is given by 
the ordered set S =< Ui,...,u r >. Note that PrLE(Vo)] = 1 since the first 
elements of any permutation is the empty set Vq. 

Let x e \Z(Vi) denote the indicator random variable corresponding to x e condi- 
tioned on the event Z(Vj). 

Similarly, let the random variable Xh\Z{V,) denote \E(G") fl H\ conditioned 
on the event Z{Vi). 

For % > 0, let f e {V) denote Pr[x e = 1 | Zty)] and let F(^) denote E[X H \ Z{V)}- 

Note that f e (V ) denote Pr[i e = 1] and F(V ) denote E[X H ]. 

Clearly 

F{V) = E feffl. 

By LemmaEl we know that for any e E H, / e (Vo) > ^r^- Thus F(Vq) > . 
Clearly, 

E[X H \Z(Vi)] = —^— £ E[X H \Z(Vou)}. 

I V Vi \ udV-Vi 
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Let u G V — Vi be such that 



E[X H \Z(ViOu)] 



wev-Vi 



max E[X H \Z(Vi<>w)]. 



Define V 



i+l — 



Vi<>u. It follows that 



F{V i+1 ) = E[X H \Z(V i+1 )} > E[X H \Z(Vi)] = F{Vi). 



In particular, it is also true that F(V\) > F(Vq). 

After n steps, we obtain the final permutation V n . Applying the above inequal- 
ity n times, it follows that 

E[X H \Z(V n )\ = F(V n ) > F(V ). 

Recalling that F{V ) > |fj, we have F(V n ) > 

Let 7r be the permutation that corresponds to the ordered set V n . The final 
interval super graph G' output by our deterministic strategy is Ai(G, it). Note 
that, F(V n ) = Hj[Xij\Z(V n )] is the same as \E{G') H H\. Thus we have shown 
that \E(CF) C\H\> -£^\H\ as claimed. 

It remains to show that the above deterministic strategy takes only polynomial 
time. This can be shown as follows using Lemma [TT1 (see below). Recall that, 
given a vertex w G V — V iy F(V, o w) is simply J2e£H fe(Vi ° w )- It follows 
from Lemma [TT] that F(Vi o w) can be computed in polynomial time. Recall 
that given Vi, Vi + \ is Vi o u where u maximizes F(Vi o w) where w G V — Vi . 
Clearly such a u can also be found in polynomial time. Since there are only n 
steps before computing V n , the overall running time is still polynomial. ■ 

Lemma 11 For any ordered set Vi =< V\, • • • ,v% > and any e G H, f e {V,) 
can be computed in polynomial time. 

PROOF. Let e = (u,v). We can compute f e (V) as follows: 

The easiest case is when both u,v are in V{. In this case, the intervals corre- 
sponding to u and v , namely I(u) and I(v) are already defined. (Recall how 
Ai(G, 7r) defines an interval supergraph of G. See definition [3]). Therefore ei- 
ther e G E{G') or e ^ E(G'). Therefore the conditional probability is either 
or 1. This is summarized as: 



Case 1.1: u,v eVi and I(u) D I(v) = 0. 



Then f e (Vi) = 1 since (u,v) <£ E{G'). 



Case 1.2: u,v e V and I{u) n I{v) ^ 0: 
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Then f e (Vi) = since (u,v) G E(G'). 



Next let us consider the case u G Vi and v £ Vi. In this case I(u) = [l(u),r(u)] 
is already defined but I(v) is not yet determined since r(v) is still random. But 
it is clear that r(v) > r(u). Moreover, we can determine whether l(v) < r{u) 
or not, irrespective of r(v): If there is a vertex w G N(v) fl V, such that 
r(w) < r(u), then l(v) < r(u), otherwise /(f) > r(u). Thus in this case also 
the conditional probability is either or 1, as computed by the following 
procedure: 

Case 2.1: u G V h v V and N(v) n VJ = 0: 

Then f e {V) = 1. 

Case 2.2: ue^v^ and /V(f ) DVj^ 0: 

Let t = min weAr („) nyi r(w) . 
If r(«) > * then / e (V|) = else / e (^) = 1. 

Now we examine the case when both u and f do not belong to V$. First 
consider the sub case when both N{u) fl Vi and N(v) fl are non-empty. 
Then clearly I(u) fl /(f) ^ 0. This is because of the following. Let us denote 
by the ith vertex (the last vertex) in V*. Clearly, /(-a) < r(fj) < r(u) and 
/(f) < r(vi) < r(v). Thus r(fj) G /(-a) fl /(f). It follows that in this case the 
conditional probability is as give below. 

Case 3.1: u,v eV - V and N(v) n V ^ and 7V(n) nV^ 0: 

Then / e (^) = 0. 

Next subcase is when N(u) fl Vi and N(v) fl are both empty. That is, the 
set X = {u} U N(u) U {v } U 7V(f) has empty intersection with Vi. Let ir x 
be the projection of 7r onto X. Since X fl Vi = 0, n x can be any possible 
permutation of \X\ elements with equal probability, namely j-^. We estimate 
the conditional probability f e (Vi) as follows. Clearly, (n, v) ^ E(G') if and only 
if n x (u) < mm we{v}uN ( v )TT X (w) or n x (v) < mm we{u } UN ( u )ir x (w). It is easy 
to see that the probability for the above condition to hold is d< J^ +2 + j(vj+2 ' 
Thus, we have the following case: 

Case 3.2: u,v eV - V and N(v) H V = and N(u) n V t = 0: 

Th en /e(V^) = ^ + ^ " 

Now we are left with the last sub case: N(v) D V* ^ and 7V(w) n V* = 0. Let 
X = {n} U 7V(u) U {v }. Note that X fl Vi = 0. Clearly ir x can be any possible 
permutation of \X\ elements with equal probability, namely j^jj = ( d (Jj +2 y. • 
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We estimate the conditional probability f e {Vi) as follows. 



We claim that that (u,v) ^ E(G') if and only if irx(v) = 1. To see this, 
first observe that if (u,v) E{G') then ttx(v) < ttx(u) because otherwise 
r(u) < r(v) and since N(v) D Vi ^ 0, we get further l{y) < r{u) < r(v) leading 
to a contradiction. Therefore 1 < tcx{v) < ^x{u). Now, if 7Tx(w) = 1 for some 
w G N(u) then l{u) < r(v) < r(u) and thus I(u) fl I(v) ^ leading to a 
contradiction. On the other hand, if ttx(v) = 1 then clearly, r(v) < r(w) for 
any w G N(u) U {u}. It follows that r(v) < l(u) and therefore I{v) fl J(u) = 0. 

Now, having shown that (u,v) ^ E(G') if and only if 7Tx(v) = 1, we can 
compute / e (Vi) by computing the conditional probability that tt x (v) = 1, 
which is simply |^^|| = „ J +2 . Thus we have the following final case: 

Case 5.5: u, v G V - V { and N(v) n ^ and JV(«) n ^ = 0: 
Then / e (K) = 



Theorem 12 Lei G be a graph on n vertices with maximum degree A. The 
box representation of G in [(A + 2) Inn] dimension can be constructed in 
polynomial time. 



PROOF. Let h = \E(G)\. It follows from Lemma fTUl that we can construct 
t interval graphs such that the number of edges of E(G) which is present in 

all of these t interval graphs is at most (^2) h. If (2^5) h < 1, then we are 
done. That is, we are done if tin +ln < is true. Clearly this is true, if 

t > ln/i/ln (^2^\ We can assume that A > 1. If A < 1 then G is an interval 
graph and the theorem trivially holds. Using the fact that In > 4 — 1(^) 2 , 
we obtain box(G) < 2 (A-i) ^ n ^ — + 2) Inn. By Lemma [TUl each interval 
graph is constructed in polynomial time. Hence the total running time is still 
polynomial. Thus the theorem follows. ■ 



4 In terms of average degree 



It is natural to ask whether our upper bound of (A + 2) In n still holds, if we 
replace A by the average degree d av . Unfortunately this is not true in general: 
we show below an infinite family of graphs where the boxicity is exponentially 
higher than (d av + 2) In n. Construct a graph G on n vertices as follows. First 
take a Roberts graph on n x vertices where n x < n. (Refer to section I2TT1 for the 
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definition of Roberts graph.) Let the remaining n — n\ vertices form a path 
which is connected by an edge to one of the vertices of the Roberts graph. The 
average degree of G is d av = (nx(n\ — 2) + 2(n — rii))/n, whereas its boxicity 
is at least ni/2 > \\Jn(d av — 2) . It is easy to see that the boxicity of G is 
exponentially larger than (d av + 2) Inn for example when n\ = 9(^/n). 

Nevertheless we show the following general upper bound for boxicity in terms 
of d av . In fact, we will express the upper bound in terms of n and the number 
of edges m. 

Theorem 13 For a connected graph G on n vertices and m edges, box(G) < 
5vr7ilnn. Moreover, there exists a connected graph G with n vertices and m 



edges such that box(G) > 




PROOF. We show the upper bound as follows. Let x = ym/lnn. Let V 
denote the set of vertices in G whose degree is at least x. It is straightforward 
to verify that \V'\ < 2m/x. Let G" be the induced subgraph on V — V. 
Each vertex in G" has degree at most x. By Corollary [71 we obtain that 
box(G") < (x+2) Inn. Since box(Cr") + | V'| is a trivial upper bound for box(G), 
it follows that box(G) < (x + 2) \ian + 2m/x < 5Vm Inn since m > n — 1. The 
example graph discussed in the beginning of this section serves as the example 
that illustrates the lower bound. ■ 



4-1 Boxicity of Random Graphs 



Though in general boxicity of a graph is not upper bound by (d av + 2) Inn, 
where d av is its average degree, we now show that for almost all graphs, the 
boxicity is 0(d av Inn). 

We show that for almost all graphs in the the Q(n,m) model, box(G) is 
0(c\nn) where c = 2m/ n. We shall only consider connected graphs here 
and so assume that m > n — 1. Thus, we have, c > 1. But we first show the 
result for the Q(n,p) model setting p = c/(n — 1). As shown in [6], we can 
then carry over the result to the Q(n,m) model since p = mj 

Consider the Q(n,p) model with p = c/(n — 1). Let G denote a random graph 
drawn according to this model. For a vertex u, define a random variable d u 
that denotes the degree of u, i.e., d u = \N(u)\ = J2 v ev{G),v^u e u,v where e u>v is 
an indicator random variable whose value is 1 if (u, v) G E(G) and otherwise. 
Therefore, E[d u ] = p(n — 1) = c. 

Case 1: c > Inn. 
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Since d u is the sum of independent Bernoulli random variables, we can use 
Chernoff bound to bound the probability of d u becoming large. In particular, 
we use the following form of the Chernoff bound given in [22] for the rest of 
the proof. 

<5 2 EJX1 

PrLY > (1 + 5)E[X)] < e-^w (1) 

for all 5 > 1. Taking 5 = 5, we get, Pr[d„ > 6c] < 1/n 3 . Now, by union bound, 
it follows that Pr[A(G) > 6c] = Pr[3w E V(G),d u > 6c] < 1/n 2 . Using the 
result box(G) < (A + 2) Inn, we now have, box(G) < (6c + 2) Inn with 
probability at least 1 — 1/n 2 . 

Case 2: c < Inn. 

Let S u = V(G) - N(u) - {u}. 

Let N'{u) = {v E S u | 3u' E N(u) such that (u',v) E E(G)}. 
In this case, we will use a different technique to upper bound boxicity. Let the 
graph G 2 denote the square of G. That is, V(G 2 ) = V(G) and (it, v) E E{G 2 ) 
if there is a path of length 1 or 2 between u and v. The authors showed in [23] 
that the boxicity of a graph G is at most 2A(G 2 ) + 2. We will show below that 
if c < Inn, then A(G 2 ) < c + 6hm + 7c 2 + 42clnn, with high probability. The 
reader may note that the degree of a vertex u in G 2 equals \N(u) \ + \N'(u) \ . We 
will now show that for any vertex u, Pr[\N(u)\ + \N'(u)\ ^ O(clogn)] < 3/n 3 . 

Let k = c + 6 Inn. We apply Chernoff bound ([1]) with 5 = 6 In n/c to obtain 

Pr[d u > k] < e ^(^)/(2+s) < 1/n 3 

Let A C V(G) such that \A\ < k. Let Z(A) denote the event that N(u) = A. 
Now, for each vertex v E S u , let X v ^ denote an indicator random variable 
indicating whether v E N'(u) conditioned on the event Z(A). Note that for 
any vertex v E S u , Pr[X v ^ — 1] < kp. Let Xa = J2 v es u Xv,A- It follows that 
E[Xyi] < kp(n— 1) = kc. Since Xa is the sum of independent Bernoulli random 
variables, we apply the Chernoff bound (TjQ) by fixing 5 = 6A;c/E[Xa] to obtain 
Pr[X A > 7kc] < e -mc)/(2+S) < X j n z 

Let the random variable X u = \N'(u)\. We now have, 

Pr[X u > 7kc \d u <k]= Y Pr\(X„ > Ike) A Z(A)] 

ACV(G),\A\<k 

Pr[X A > 7kc] Pr[Z(A)} < 1/n 3 

ACV(G),\A\<k 

It follows that 
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Pr[X u > 7kc] = Pr[X u > 7kc \ d u < k] Pr[d u < k] 
+Pr[X u > 7kc \d u >k] Pr[d u > k] 
< (l/n 3 )Pr[d u <k} + (l/n 3 )Pr[X u > 7kc \d u >k] < 2/n 3 

Let t u = \N(u) \ + \N'(u)\ = d u + X u . Combining the bounds on the values of 
d u and X u , we get, 

Pr[t u >k + 7kc] < Pr[d u > k] + Pv[X u > 7kc] < 3/n 3 

Observe that A(G 2 ) = max u( zGt u . Thus, by applying union bound, we obtain 



Pr 



A(G 2 ) > k + 7kc 



Pr 



\J t u >k + 7kc 
ueV(G) 



< 3/n 2 



Thus, with high probability, A(G 2 ) < k + 7kc = c + 6 Inn + 7c 2 + 42clnn. 
Recalling that box(G) < 2A(G 2 ) + 2, we obtain box(G) G O(clnn) with high 
probability, since c < Inn. 

Having shown that in the G(n,p) model, Pr[box(G) ^ O(clnn)] < 3/n 2 , the 
following relation from page 35 of [6] helps us to extend our result to the 
G(n, m) model. 

P m (Q) < 3m l ^ 2 P p (Q) 
where Q is a property of graphs of order n, and P m (Q) and P P (Q) are the 
probabilities of a graph chosen at random from the Q(n,m) or the Q(n,p) 



models respectively to have property Q given that p 



m 



Using this 



result, we now have, for a graph G drawn randomly from the G(n, m) model, 

Pr[box(G) O(clnn)] < 9n' 2 ^/m < 9/n 

As c = 2m /n = d av , which is the average degree, we have shown that for 
almost all graphs with a given average degree d av , the boxicity is 0{d av \nn). 

Thus we have the following theorem: 

Theorem 14 For a random graph G on n vertices and m edges drawn ac- 
cording to Q (n, m) model, 



Pr 



2/77 

box(G) = 0[— Inn 



>i-i 

n 
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